FROM SPLINE APPROXIMATION TO ROTH'S EQUATION AND SCHUR 

FUNCTORS 



JAN MINAC AND §TEFAN O. TOHANEANU 

Abstract. Alfeld and Schumaker provide a formula for the dimension of the space of piecewise 
polynomial functions, called splines, of degree d and smoothness r on a generic triangulation of a 
planar simplicial complex A, for d > 3r + 1. Schenck and Stiller conjectured that this formula 
actually holds for all > 2r + 1. Up to this moment there was not known a single example where 
one could show that the bound d > 2r + 1 is sharp. However, in 2005, a possible such example was 
constructed to show that this bound is the best possible (i.e., the Alfeld-Schumaker formula does 
not hold if d = 2r), except that the proof that this formula actually works if d > 2r + 1 has been 
a challenge until now when we finally show it to be true. The interesting subtle connections with 
representation theory, matrix theory and commutative and homological algebra seem to explain why 
this example presented such a challenge. Thus in this paper we present the first example when it is 
known that the bound d > 2r + 1 is sharp for asserting the validity of the Alfeld-Schumaker formula. 



1. Introduction and background 

Let A be a connected finite simplicial complex whose geometric realization | A| is a topological 
disk in M?. Let r > be an integer. The space of splines of smoothness r and degree d is the R— 
vector space 

C^(A) = {F : |A| — yR: F\^ = polynomial of degree < d, 

Va G As, andF G C'}. 

A very nice accessible introduction to Polynomial Splines is [5, Chapter 8]. 

One of the major questions in spline approximation is to find the dimension of this vector space; 
even when d = 3 and r = 1, this dimension is not known for arbitrary triangulations. If d > 3r + 1, 
for almost all triangulations, Alfeld and Schumaker (0]) give a beautiful, yet complicated formula 
for this dimension in terms of combinatorial and local geometric data (data depending only on 
local geometry at the interior vertices of A): 
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where is the number of interior edges of A, /q is the number of interior vertices of A, and 
a = where cTj = J2j>i niax{(r + 1 + j(l — n{vi))), 0}, and n{vi) is the number of distinct 

slopes at the interior vertex t>j. For further reference this formula will be denoted by L(A, r, d). 

In [15J, by showing that a certain zeroth local cohomology is zero, Schenck and Stillman 
prove that if A has only pseudoedges (such a triangulation is called quasi-cross-cut), the Alfeld- 
Schumaker formula is true for any d. With different methods, in [fTTl the same is true, but for the 
more general case when instead of a triangulation, one has a partition. Lemma 2.5 in [|T5l also says 
that if A has at least one non-pseudoedge, the local cohomology module considered is not zero. 
In fact, Schenck and Stiller conjectured that for any A, this local cohomology module vanishes in 
degree d > 2r + 1. 

[fT4l considered a simplicial complex As with exactly one non-pseudoedge, and in |fT9l it was 
shown that for this particular example, the above conjecture is tight: for any r > 1, dim 6*2^ ( A5) 7^ 
L(As,r, 2r). The present notes are a foUowup of [19|. We show that for the same simplicial 
complex As, 

dimC^(A5') = L(As, r, d), for any r and d > 2r + 1, 

and therefore the Schenck-Stiller conjecture is true for this first non-trivial triangulation A5Q. 

The confirmation of the Schenck-Stiller conjecture in this case proved to be surprisingly chal- 
lenging, and in fact, it took several years to establish the main result of this paper. The reason for 
this difficulty seems to lie in rather deep connections with representation theory, matrix theory and 
commutative algebra. 

The proof is subtle in several places, but we put considerable effort into making our exposition 
clear and readable also for the non- specialists. In the first part we provide the solution of the 
main problem that relies on classical concepts in commutative algebra (e.g., regular sequences, 
monomial order, etc.). In the second part of the paper we investigate the connections with Schur 
functors. Roth's equation in matrix theory and lower-upper triangular matrix decompositions. 

1.1. Homological approach to spline approximation. Following flSl, consider embedded in 
M'^, and let A be the cone of A with its origin in and let i? = M[x, ?/, 2] be the ring of polynomials 
in variables x, y, z with coefficients in M. Since we consider the cone of A, from now on if e is 
an edge of A we will think of to be the homogenized equation of the equation of the line in 
where e is placed; also, abusing the terminology a bit, we are going to say that the linear form 
defines the edge e. 

Consider the finitely generated graded /?— module: 

CiA) = {F : I A| ^ M : F\^ G i?, Va G A2, and F e C}. 

Then 

dimKQ(A) = dimKC"-(A),, 

the dimension of the degree d piece of graded module C^{A). So, by taking A the cone of A, we 
homogenized our polynomials and therefore the problem is translated into a homological algebra 
problem: to find the Hilbert function of a graded module B 

^See Subsection 1.2. 

If M = (BdezMd is a finitely generated graded i?— module, the Hilbert function of M in degree d is by definition 
HF{M, d) — diniR Md- See, for example, li 1 81 for more background on Hilbert functions. 
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Let F E C""(A)rf. Piecewise, on each triangle Tj of A, F is defined by a homogeneous poly- 
nomial of degree d: Fi E R^. For F to be a C — function, since polynomials are C°°— functions, 
whenever we have two triangles Tj and Tj, with a common (interior) edge of equation iij = 0, then 

F,-F,E 

For example, if r = (i.e., F is continuous), one must have Fi(P) = Fj(P), for all P E V{iij). 
But this means exactly that Fi — Fj E {iij). 

With the above idea in mind, Billera and Rose (L4J) place C"'(A) in the following exact complex 
of graded i?— modules: 

^ C''{A) © Rf'{-r - 1) 4 Rf' ^ ^ 0, 

where = coker^cp), and 

/ ^:t' \ 

V / 

82 is the simplicial reduced boundary map R^'^ — )■ _R^i , where /2 is the number of triangles of A 
and is the linear form that defines the interior edge Cj. 
Using the properties of the Hilbert function, we obtain 

dim. C'(A). = A + ^ /? /f + HF(N, S). 

In [fT6ll . Schenck and Stillman place the graded i?— module N in the following short exact se- 
quence of graded i?— modules: 

^ i/(°,,,,,)(iV) ^ iV -> ®,^^oR/J{v) ^ 0, 

where ^ (A^) is the zeroth local cohomology module of at the maximal ideal (x, y, z), Aq 
denotes the set of interior vertices of A, and 

J(t;) = (Ci\...,C(.))' 

with j being the linear forms defining the interior edges of different slopes with one of the 
vertices being v. 

Theorem 3.1 in [fTSl describes the minimal graded free resolution of R/J{y), so 
HF{R/ J{v)), d) is known. In Corollary 4.5 in IfTSl all of this information is combined to ob- 
tain 

dimKC^(A), = L{A,r,d) + HF{H'^^^y^^^{N),d). 

Since ^ (A^) is a module of finite length, we have for d sufficiently large 
HF{H^^ y ^^{N), d) = 0. In fact, Alfeld and Schumaker show that it is enough to take d >3r + 1. 
The Schenck-Stiller conjecture claims that one can take d > 2r + 1. 
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1.2. The Schenck-Stiller example. In this section we reduce our problem to calculating the 
Hilbert function of some ideal in degree r. We will provide a detailed picture of our approach 
which is also accessible to a non-specialist in this area. 

Let R = M.[x,y, z] be the ring of polynomials with real coefficients and let := A be the 
following simplicial complex that triangulates a bounded connected region in the real plane: 



m (2,2) (4,2) 




(0,0) (2,0) (4,0) 



The goal is to compute dimR C"'(A)d, d > 2r + 1 by showing that HF{H^^^ ^ (N), d) = 0,d> 
2r + 1. 

For As, because it has exactly one totally interior edge, H? . (N) is isomorphic to R/I, where 

I 1 

/ is a certain ideal (see the assertions before Note II. II) . and therefore it is enough to prove that 
HF{H^^ y 2r + 1) = 0. Since our triangulation is specific, our analysis will be more direct, 

building on basic definitions which, via careful considerations, will lead us to our results. 

Let d = 2r+ 1 , and let us label the triangles of A with Ti , . . . , Tg, clockwise: for example the first 
triangle has vertices (0, 2), (2, 2) and (1, 1). Let £ij be the linear form defining the common edge 
between the triangles Tj and Tj. We have 9 interior edges, one of them being a non-pseudoedge 
(i.e., the edge common to the triangles T2 and Tg; this edge is placed on the line in of equation 
y = I, hence, after homogenization, £26 = y — z): 



^12 = i78 = x-y 

^23 = h5 = x + y - iz 

hi = ^6 = x-y- 2z 

47 = ii8 = x + y - 2z 

^6 = y- z. 

An element F e C"'(A)d is an 8-tuple: 



F=iF,,...,Fs), 



where Fi G Rd are subject to the conditions: 
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pr+ 
^23 


G23 




— Fa — 


«r+ 
^34 






-F, = 


^45 


»-^45 


F, 


-F, = 






Fg 


-Fj = 


/""+ 


1^ 


Fr 


-Fs = 


*78 


1^ 

<J"78 


Fs 


-F, = 


^18 


tJ18 


F2 


-Fe = 


^26 


1^ 



where Gij are homogeneous polynomials in R of degree d — (r + 1) = r. 

One should observe that once we know Fi and the Gj/s, F is completely determined. The Gi/s 
are subject to the following relations: 

{G12 + G'/s)(!u'^ + (^67 + ^18)^67^ + '^26^26"'" = 0; 
( — — Gi5)i2'^'^ + ( — 6*34 — G^q)^^'^'^ + G2e^2Q^ = 0- 

From the discussions above, considering first the possibilities for Fi, then the possibilities for 
Gi/s with 6*26 fixed, and finally the possibilities for G26 we see that the dimension we are looking 
for is 

di,„c'(A).=f''rv4r''t'v^. 

where e = dim L with 



2 \ 2 



~ V'12 ) *-67 / ■ ^26 ' ' \^23 ' ^34 / • '^2Q ■ 

The Alfeld-Schumaker formula says that 



^ 2 J V 2 y VV2/ V2 
and from the proof of Corollary 4.5 in [|T5l . 

r + 1 

(T = 2ra — 2c? , where a = [ — ^ — J , 

the smallest integer larger than ^ . 
We have d = 2r + 1. 

If r + 1 = 2n, then a = 2r? — 2n. For this case 

dmvC^{k)d- L{^,r,d) = t - n. 

If r + 1 = 2n + 1, then a = 2r?. For this case 

dimC"^(A)d-L(A,r,d) =e-(n + l). 

So the Schenck-Stiller conjecture is true for this A, if one can show that e = nifr + l = 2n, 
and e = n + 1 if r + 1 = 2n + 1. Our goal is to prove these equalities. 
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In the language of the previous subsection, one needs to show that 

HFiHl^^,^iN),2r + l) = 0. 
Indeed our goal matches this new goal as we can see below. 



First, by making the change of variables suggested in [|T9ll : y—z = ^y, x—y = x, x—y—2z = z, 
we can assume that £26 = y, ^12 = x, Iqj = x + y and £34 = z, £23 = x + z. So the ideals J{vi) at 
the two interior vertices become: 

J{v,) = {x^^\{x + yY^\y^~''), 

and 

J{v2) = {z^^\{z + yy^\f^'). 
Second, by [fT5l , Theorem 3.1, the graded minimal free resolution for R/ J{vi), z = 1, 2 is 

^ R{-a) © R{-b) % R\-r - 1) ^ R ^ R/J{vi) 0, 

[A, DA 

where a and b are some shifts (see Note ll.ll below) and ^/'j = ( -Bj Ei . For example, if i = 1 

this means that + Bi{x + yY+^ + Ciy''+^ = and D^x^'^^ + Ei{x + yY+^ + Fiy^+i = 0. 

We should note that 

{C,,F,) = {x^^\{x + yy^'):f^\ 

and 

{C2.F2) = {f^\{z + yy^'):f^\ 

Third, by [[H, 

HUA^) ^ ^(-^ - ^1' ^2, F2). 

With this, the Schenck-Stiller conjecture for this A reduces to showing that 

HF{R/{C^,F,,C2,F2),r)=0. 

Note 1.1. By IBl, Theorem 3.1, if r + 1 = 2n, then degid) = deg(Fi) = n,i = 1,2, and if 
r + 1 = 2r;, + 1, then deg(Ci) = n, deg(Fj) = n + 1, z = 1, 2. Furthermore, the ideals (Cj, Fi) are 
complete intersections (see lfT9l ). 

We have an exact sequence of R— modules: 

R R R R 

^ {c„ Fi) n {C2, F2) ^ ® ^ (Ci, Fi, C2, F2) ^ ■ 

Since (Cj, Fj), 2 = 1, 2 is a complete intersection, we have the graded minimal free resolution 

^ i?(-(deg(a) + deg(F,)) ^ deg(a)) © R{- deg(F,)) i? ^ R/{Ci, F,) ^ 0, 
and the Hilbert function of i^^^'p^^ © (^c^p-z) computed using these resolutions. 
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Therefore, to prove the claim it will be enough to show 

2n2, ifr = 2n-l; 

2n(n + l), ifr = 2n. 

Equivalently, our main goal is to prove 

Theorem 1.2. 

ffF((C,.F.)n(C..f.),r)=.= {^-^^_ '^Ztr'' 
2. Proof of the main result 

Let us denote 

K(r) = (({x^+\ (x + yr') n {z^+\ (z + yr')) : 
We want to show that 

dim K{2n — 1) — n and dim K{2n) = n + 1. 

We have the following sequence of useful lemmas. 

Lemma 2.1. For any n > 1 we have 

dimi^(2n — 1) > n and dimi^(2n) > n + 1. 

Proof. Since HF{jp^-^^^^^y^, r) > 0, then from the exact sequence above we have HF{R, r) — 
dim K{r) < 2HF{ , r) and hence the result. □ 

We have 
and 

Observe that Ci, Fi are polynomials in variables x and y so they are elements m A — y\. 
A is a subring of i? = W[X,y,z\. We will denote by {Ci,Fi)A the ideal in A generated by 
Ci, Fi, and we will denote by (Ci, Fi)R the ideal in R generated also by Ci, Fi. We also have 
(Ci,Fi)i?n/l = (Ci,Fi)A 

Similarly, C2, F2 are polynomials in variables y and z, and so they belong Xo B — C R. 

Again, (C2, -^2)^ will denote the ideal in B generated by C2, F2, and (C2, -F2)-R will denote the 
ideal in R generated by 6*2,^2. 

Lemma 2.2. Let F e K{r). Then 

(1) If we write F = z''/o+^''"7i + - ■ ■+zfr-i+fr with fieA = R[x, y], then fi G (Ci, Fi) A 

(2) If we write F = x^gg + x^~^gi + • • • + xgr-i + gr with gj G B = M.[y, z], then gj e 
{C2,F,)B. 
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Proof. It is enough to prove the first part, since we can switch the roles of x and z afterwards. Let 

F = + z'-^h + • • • + zfr-i + fre Kir). 
Since F e (Ci, Fi)R, substituting 2; = we obtain fr G (Ci, Fi)A. Therefore 

z{z'-'fo + --- + fr-i) e {C^,F^)R. 

Since (Ci, Fi)A C A is a complete intersection, then {z, Ci, Fi} is an i?— regular sequence, and 

therefore z'^~^fo H h fr-i G (Ci, Fi)R. Again, by making z ~ 0, and with the same argument 

as above, we obtain inductively that for alH = 0, . . . , r, one has fi e (Ci, Fi)A. □ 

A useful consequence of the proof of the above lemma is that if F e K{r) and if x°'y^z'^ is a 
nonzero monomial in the expression of F, since the ideals (Q, Fj) , i = 1, 2 are generated in degree 
> n, then a <r — n and c <r — n. 

The key result is the following lemma. For convenience we assume that has any degree. In 
this section most of the arguments use induction on r (the base cases are simple calculations, and 
we left them out of the notes), and the third part of this next result will help resolve the inductive 
step. 

Lemma 2.3. The following statements are true: 

(1) e K{r). 

(2) For any j < r - 1, ^ {x'-+\ {x + yY+'^) : y'-+^ andy^ ^ {z''+^, {z + yY+^) : y''+^. 

(3) IfF e K{r), then y|£ = yF^, e K{r - 1). 

Proof Let / = {x + yf^^) : = (Ci, Fi) C ^ = y]. 

(1) . From the exact sequence 

^ A(-(deg(Ci) + deg(Fi)) ^ A{- deg(Ci)) © A{- dcg(Fi)) ^A^ A/{C^, F,) ^ 0, 
we have 

HF{A/I, r) = r + 1 - (r + 1 - deg(Ci) + r + 1 - deg(Fi)) + (r + 1 - deg(Ci) - deg(Fi)) = 0, 
so y'^ e /. Similarly, replacing x with z, we have y'^ e i^(r). 

(2) . Let j < r — 1. If y^ e (a;''+\ {x + 1/)''+^) : t/''"^^, then there exist P,Q e A of degree j such 
that 

Taking the partial derivative with respect to x we obtain 

= ((r + 1)P + xP,)x' + ((r + 1)Q + {x + y)Q.){x + yf. 

Therefore (r + 1)P + xP^, which has degree j, is a multiple of {x + yy. But 3 < r — 1, and 
therefore 

(r + l)P + xP^ = 0. 

If P = ^ aiX^y^^\ we have 

ao = and (r + 1 + i)ai = 0, i > 1. 
Hence F = 0, and therefore we have a contradiction. 
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(3). Let F e K{r). So there exist P,Q e R such that 
Taking the partial derivative with respect to z we have 

Taking the partial derivative with respect to x we have 

y'XyF,,)e{x\{x + yy). 

Similarly, we have 

f{yF,^)e{z\{z + yy). 
Since deg{yFxz) = l + r — 2 = r — l,we indeed obtain that yF^^ G K{r — 1). □ 

Proposition 2.4. We have 

K{r) = W:={fe (x + yY+') : f+Wf{x, y, z) = f{z, y, x)}. 

Proof. For convenience denote with f{x,y,z) = f{z,y,x). 

The inclusion "3" is immediate, by switching x and z, so dim K{r) > dim W. 

For the other inclusion, consider {/i, . . . , fm} to be a basis for K{r). Since /j G K{r), then 
fi G -ft'(r), and therefore gi := fi + fi E K{r). Since (^j = (^j, and since 

g,eK{r)(l{{x^^\{x + yY+'):f^\, 

we have that G W,i = 1, . . . ,m. 

Suppose that these gi are linearly dependent. Then there exist ci, . . . , G M not all zero, such 
that 

Cl(/l + /l) + ■ ■ ■ + C„(/„ + fm) = 0. 

We have F := ci/i + ■ ■ ■ + Cm/m G -ft'(r) and F = —F. We show by induction on r that in 
these conditions F = 0. From Lemma [23] (3). we have yF^z G K{r — 1). Also yFa;^ = —y{F)xz, 
and by induction 

yF^z = 0, 

which means 

F = P{x,y) + Q{y, z) = a„(a;'-^-z^-'^)i/'^ + a„+i(x^-"-i-z^-"-i)y"+i + - ■ ■ + ar.i{x-z)f-\ 
From Lemma [2!2l we have that for j = n, . . . , r — 1, 

a,y^ &{f^\{z + yr^'):f^\ 

and therefore from Lemma [23] (2). Oj = 0. 

This means F = 0, and since /i, ...,/„ are linearly independent, all the Cj must vanish. But 
this implies that dim W > m = dim K{r), and with the previous inclusion we obtain the desired 
equality. □ 

Proposition 2.5. The ideal 

X{r) := {{x^+\ {x + yf^') : f^') H {{z^+\ {z + yf^') : 
is minimally generated in degree r. 
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Proof. We show by induction on r > 2, that (lim.^X{r)r-i = 0. 

Let F G X{r)r-i. The same proof as for Lemma [23] (3). gives us that 

yF^, e X(r - 1),_2. 

By induction this must vanish and therefore 

F^, = 0. 

So 

F = ar-ix'"^ + ar-2x''~'^y H \- aixy''-"^ + aoy''~^ + bizy''""^ H h K-iz''^^ G X(r). 

A similar proof as for Lemma [Z2l will yield 

ar-i, ar-2Z/, ■ • • , ai/~' e (^'■+\ {z + yy+') : y'+K 

These must vanish because of Lemma [23] (2). Similarly, hi = ■ ■ ■ = 6,-1 = 0. So F = aoy'^~^ E 
(x + yY'^^) : and this must vanish as well, from the same lemma. □ 

Now we can prove our desired equalities. 

Theorem 2.6. For all n>2, 

dimi^(2n — 1) = n. 



Proof. From Lemma [ZTl it will be enough to show that dim K{2n — I) < n. 
As we denoted before, let 

I{2n - 1) = i{x'\ (x + yf-) : y'^ H {{z'"^, {z + yf^) : y^"). 

Then we need to show that 

HF{I{2n-l),2n- 1) < n. 

Let >- be the Graded Reverse Lexicographic order on the monomials of R = M.[x,y, z], with 
X >- y >- z. Let 

iny{I{2n - 1)) 

be the initial ideal of X{2n — 1) with respect to y. 

From Proposition 12.51 X(2n — 1) is minimally generated in degree 2n — 1. Also, since by 
Proposition [231 the polynomials of degree 2n — 1 in X(2n — 1) are symmetric in x and z, we have 
that 

{y^^-\ x/"-2, . . . , ^n^Y, xzMu xzMp} 

includes the monomials that generate in^(X{2n — 1)). Here we also used the remark after Lemma 
12.2! that the power of a; in a nonzero monomial of an element of degree 2n — 1 inX(2n — 1) is 
< r — n = 2n — 1 — n = n ~ 1. 

Next we show that if xzM is the leading monomial of an element F E X(2n — 1), then deg(F) > 
2n. Let F E X{2n - l)2n-i = K{2n - 1) be such that in^{F) = xzM. By Proposition [Ml F is 
symmetric in x and z, so 

F = xzG, 

for some G E R. 
We have 

= Ax^" + B{x + and ^^"(x^G) = Cz^"" + D{z + y)^". 
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We have {z, x^", (x + y)^"} and {x, z^", {z + y^"} are i?— regular sequences so 

y^'^ixG) = A'x^"" + B\x + yf" and |/^"(zG) = C'z^'" + D\z + yf". 
From this we have that x\B' and z\D' , and therefore 

^2n^ = + 5"(a; + y)^'^ and ^^"G = C"^^"-^ + D"{z + y)^". 

Taking the partial derivative with respect to y of the two equations above, we obtain 

l/2"-i(2nG + yGy) G (x + y)^""^) and y2«-i(2nG + yG^^) G (^^^'S + 

We obtained that 2nG + yGy is an element of degree 2n — 3 in X{2n — 2), or is equal to zero. 
So from Proposition l2.5l 

2nG + yGy = 0. 

If G = ^ ttij^kX^y^ z'', then (2n + j)aij^k = 0, and so G = 0. 

We obtained that the leading monomials of elements of degree 2n — 1 in I(2n — 1), belong to 
the set {?/^"^\ xy^""^, . . . , x"~^y"}, and therefore 

HF{X{2n- l),2r2- 1) < n. 

□ 

Theorem 2.7. For all n > 2, 

dimK{2n) = n + 1. 

Proof. From Lemma [ZTl it will be enough to show that dim K{2n) < n + 1. 
Since y^" G K{2n), from Lemma [23] (1). we can find a basis for K{2n): 

{y^^,H,,...,H^}. 

Supposem > n + 1. From Lemma [23] (3), y{Hi)xz, ■ ■ ■ ,y{Hm)xz G K{2n — \). From Theorem 
I2.6[ we have dim_ft'(2n — 1) = n and therefore, these elements must be linearly dependent. So 
there exist constants ci, . . . , Cm G M, not all zero, such that 

ciy{Hi)^z H h Cmy{Hm)xz = 0. 

This implies that 

{CiHi + ■ ■ ■ + CmHm)xz = 0. 

So H := CiHi + ■ ■ ■ + CmHm G K{2n), which is symmetric in x and z from Proposition [2]4l is 
of the form 

H = a2„(x2" + ^2") + . . . + a2(x' + z^)y^^~^ + ai(x + + aoy'" 

Using Lemma [Z2[ and Lemma [23] (2). we obtain 

Oi = ■ ■ ■ = = 0, 

which leads to the linear dependency 

-aoZ/^" + CiHi H h CmHm = 0. 

This is a contradiction since {?/^", Hi, ... , Hm} is a basis for K{2n). We obtain that m <n which 
proves the theorem: 

dimK(2n) <n + l. 
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□ 

3. Connections with Schur functors and Roth's equation 

Our initial approach to prove Theorem 11.21 was more direct: consider an arbitrary element in 
(Ci, Fi)r and we require it to belong to (C2, F2)r', this will lead to a comparison of polynomial 
coefficients. 



Let 

i+j+k=r 

be an element in (Ci, Fi) fl (C2, F2) C R = ]R[a;, y, z] of degree r. 
We can write 

(*)F = zJo + z'^-'h + --- + zfr-l+fr 



where for /c = 0, . . . , r, 



fk — O.0,k,r-ky + 0'l,k-l,r-kXy + ■ ■ ■ + Ofc 

fi,r—k-^ 

is a homogeneous polynomial of degree km A = ]R[x, y]. 
From Lemma |2!2l we obtain that for all A; = 0, . . . , r, 

h G (Ci,Fi)A 

Since (Ci, Fi)A is minimally generated in degree > n (see Note ll.ll) . then we obtain 

/O = • ■ ■ = /n-l = 0. 

We are going to use these equations later when we switch the roles of x and z. For now let us 
consider k = n, . . . ,r. We have 

he{C,,F,) = {x^+\{x + yr+'):y'+\ 

and therefore 

(**) = Pkx'^' + Qkix + yr^\ 

where Pk = XlLo Pi,k-iX^y^~^ and Qk = Yl'^=o (li,k-iX^y''~^ are some homogeneous polynomials 
of degree k in A. 

Writing [x + yY~^^ = ^^=0 ('^t^)-^*^^^^ * identifying the coefficients of x^y"" in both the 
left and right-hand sides of (**), for each k = n, . . . ,r,we obtain: 

(1) For the monomial x^y^ with < m < /c we can determine the coefficients of fk from the 
coefficients of Qk, as follows: 



/ O,0,k,r-k \ 
0'l,k-l,r-k 

\ 0,kfl,r-k I 



( 1 





\ 


(-^) 







m ■■ 




: 




ail) ■■ 





V Qkfi J 
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(2) For the monomial x^y'" with /c + l<n<rwe obtain the following conditions on the coeffi- 
cients of Qfc: 



Vfc+2/ \k+l) 



CV) 



qi,k-i 







^ m K) ■■■ CI) j \ / Vo/ 

(3) For the monomial x'^y^ with u > r+1 the coefficient identifications will express the coefficients 
of Pk in terms of the coefficients of Qk. 

( Po,k \ I 1 C^) ■ ■ ■ (r+t-J \ / ?o,fc \ 

1 1 {^f) ■ ^ 



\ Pk,o / 



\r+l-(k-l)J 



\ 







Ql,k-1 



J \ Qkfi / 



These equations will not be useful for our computations. Basically they show how to create Pk 
from Qkto have (**) be valid. 

Let us consider our initial polynomial F expanded by the powers of x: 

{***) F = X^'go + x'^^Qi H h XQr-l + 5'r, 



where 

Qi = ar-i^i^y' + ar-i,i-i,iy'~^z H h flr-i.o.i^:* e B = R[z, y] 

are homogeneous of degree i. Using the fact that F E {C2, F2)R, Lemma 12.21 yields gi E 
{C2,F2)B. Since this ideal is minimally generated in degree n (see Note II. ll) we have = 
■ ■ ■ = Qn-i = 0. Equivalently, 



O'r-{n-l),n~l,0 — 0,r-(n-l),n-2,l — ■ ■ ■ — ar-(n-l),0,n-l — 0. 

These mean that for each k = n, . . . ,r, the last n+k—r of the coefficients Oj j ^ in (1) above must 
vanish. So for each /c = n, . . . , r, we obtain n + k — r more linear relations among the parameters 
Qafi, that combined with the relations already obtained in (2), yield that for each k = n, . . . ,r, the 

Ql,k—1 

vector ' . is in the kernel of a n x (A; + 1) matrix M. (k) . We will see this matrix in more 

\ Qkfi J 

detail later on when we study the cases r = 2n — 1 and r = 2n. 

What remains from (1) are the first (k + 1) — (n + k — r) = r — n + 1 of the coefficients aj,j,fc. 
So for all k = n, . . . ,r. 
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/ O'0,k,r-k \ 

Ql,fc-l,r-fc 

\ ^r—n,k—{r—n),r—k / 



(^l,n—l,r—n 
\ ^r—n,2n—r,r—n 



( 1 












o\ 


(t) 




1 












i' 

























kCD ( 


r+1 \ 
r-n-l) 








k = n, . . . , 


r 


we obtain 








\ 




/ 


QO,n 




ai,r-l,0 




= V- 




(ll,n-l 










dr—rijnfi 


I 




\q 


r~n,2n- 


r 



/ 10,k \ 

Ql,k-l 

\ Qi — n,k—{r—n) / 



Qr—n,n I 



where V is the lower-triangular matri)<ll in the previous vector equation. Denote with A the matrix 
to the left of the above equation and with Q the matrix of ga.b's- 



A = V-Q. 



At this moment we interchange the roles of x and z, and we start with F expanded by the powers 
of X, as we've seen in (***). At this point we mentioned that gi G (C2, F'ljB, and therefore for 
z = ra, . . . , r, 

Similarly as before 

y'^^g, = R,£^^^S,{x^yY^\ 

where Ri = J2k=o '^i-k,ky^~^z^ and Si = Yl\=o Si-k,ky^~^z'^ are some homogeneous polynomials 
of degree i'm B. 

With the same arguments as before (here we use the fact that /o = ■ ■ ■ = fn^i = 0), we obtain 
that for i = n, . . . ,r, the vector . ' is in the kernel of the same n x (i + 1) matrix 

\ So^i ) 

Furthermore, we have for alH = n, . . . , r, 



/ 



\ (^r—i,i—{r~-n),r—n / 



( 1 

rt;) 1 

1 . ,+1 X 

\ \r—nJ Vr— n— 1/ 



\ 





: 



■Sj-1,1 



i— (r— n) 



Combining these vector equations for alH = n, . . . , r we obtain 



A square matrix is called lower { upper) triangular if the entries above (below) the main diagonal are all zero. 
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( (^r—n,n,0 
(^r—n.n— 1,1 



\ ^r—n. 



2n—r,T—n 



J 



( Snfl 
\ ^2n—TA — 



■5r-l,l 



where V> is the same invertible matrix as before. Denote with B the matrix to the left of this matrix 
equation and with S the matrix of s^^^'s. 



B = V-S. 

Observe that the entries of B are the same as the entries of A, but in different positions. For 
example, the last column of B is the first row of A written backwards. This pattern is true for all 
the columns of B. In matrix form this relation can be expressed as 

J -A^ -J = B, 



where J is the exchange matrix 



Everything put together gives 







V 1 



\\ 

1 



0/ 



To summarize, we parameterized the elements in (Ci, -Fi)^ fl (C2, F2)r by two sets of parameters 
qa,h and s„ t,, both in the kernel K of the same matrix, and with (r — n + 1)^ relations among them 
given by the above matrix equation. As we will see in the next subsections, the real challenge is 
not to find dim K (we will use powerful results from representation theory to do this), but it is 
to answer the following question: given any two matrices S and Q satisfying the above matrix 
equation, can these two matrices be extended to two sets of parameters g^.fe and Su,v that are in the 
kernel 



To find the dimension of the kernel K of the matrix with diagonal blocks M.(n), . . . ,M.{r) 
one has to use relevant facts about Schur functors. We briefly recall some of these in the next 
subsection. We follow the nice exposition in [[6l, A.2.5, but we also refer the reader to [[91, Chapter 
1. 

3.1. Schur functors. Let y be a vector space over a field of characteristic 0. Let dim\^ = 
t. The finite dimensional representations of SLiy) decompose into a direct sum of irreducible 
representations. These summands are called Schur modules and can be viewed as functors (called 
Schur functors). To a sequence of numbers t > di > ■ ■ ■ > dg > one can associate the Schur 
module S^'^^'^^^'^^^V, which is a nontrivial, irreducible finite-dimensional representation of SL(y). 

Let F be a Young diagram containing s rows of boxes, row i having rfj boxes. Label the boxes 
in matrix notation; thus row i has the boxes labeled il, i2, . . . , idi. Set Ai = {il,i2, . . . , idi} and 
A = AiU A2U ■ ■ - U As. 
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Similarly, we label the columns of F as -Bi, . . . , B^^ . For example Bi = {11,21,. ..,sl}. Thus 
A = BiU---UBd,. 

For each index ij G A we consider a copy Vij = V, viewed as a representation of G = SLiV). 
For each set Ai (or Bj) as above, we can consider Sa,V, A^^V, Ta^V symmetric, exterior or tensor 
products of copies of V labeled by indices in (or Bj). 

For the sake of simplicity, assume C = {1,2,..., m} and let us recall that we have an embed- 
ding 

given via 

Vi A ■ ■ ■ AVc^ ^ sgn{a)v^(^i) (g) ■ ■ ■ (g) i;^(c), 

where Sc is the symmetric group on c letters. 

Thus considering the induced natural maps, let $ be the composition 

A^W ® • ■ ■ ® A^^V Ta,V ® • • ■ ® Ta^ = Tb,V (E)---®Tb,V ^ Sb,V ® ■ ■ • ® Sb,V. 

We set 5'{'^i> - ''^4 to be the image of $. The classical convention to create S^'^^''''''^''^V from the 
Young diagram Y is to take antisymmetric products on columns and symmetric product on rows. 
Observe that in the construction is reversed. 

The action of G = SL(y) on V extends naturally to the action of G on TcV for each c G N, by 
acting cr G G on each factor, and this action induces an action of G on 5'{^i. .'^4 

Example 3.1. S^'^^^^V . The Young tableau is 



We have Ai = {11, 12}, A2 = {21}, Bi = {11, 21}, B2 = {12}. So A = AiU A2 = BiU B2. 
$ is the composition 

A^W ® A^W ^Ta,V^ Ta,V ^ Tb,V ® Tb,V A Sb,V ^ Sb,V, 

where 

11 A f 12) (g V21) = f 11 (g f 12 <g V21 - V12 (g vii (g f2i, 

and therefore 

A V12 (g f2i) = vii ■ V21 (g V12 - V12 ■ V21 (g vu. 

vii ■ V21 and V12 ■ V21 are products in S2V. We have the fact that /m($) = S^'^'^^V. 
Skipping details that can be found in fl\. Chapter 1, Lecture Six, one can show that 

One idea of showing this is to embed everything in T3V, and use the conventions and the approach 
in llTl. This is based on showing that S^'^'^^V is the kernel of the map: 

p : AV ®V ^ AV, 

given by p{vi Av^^ V2) = f 1 A f2 A f 3. 
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Hence we obtain 

2 

In fact, in general the formula for dim S^'''^'^^^''^^^V is the determinant of the matrix M, where M 
is the s X s matrix with 

t 



(Observe that in Theorem A 2.9, there is a misprint in the entry Mg^s-i where it should read 

3.2. Finding dim K. We have to study the two cases: r = 2n — 1 and r = 2n. 

3.2.1. The case r = 2n — 1. If r = 2ri — 1, for k = n, . . . ,2n — 1 we have 

/ m„ m„_i ■ ■■ mi mo ■ ■ ■ \ 

m„+i m„ ■■■ m2 mi ttiq ■■■ 
M{k) = . . ... 

\ m2n-i m2„_2 ■ ■ ■ m„ m„_i m„_2 ■ ■ ■ m2n-i-fc / 

is an n X {k + 1) matrix. We denoted = (^") , £ = 0, . . . , 2n — 1. 
We also have 







Lemma 3.2. We have 



K = ®tnKer{M{k)). 
dimir="^" + ^^. 



2n-l 



2ri-l 



Proo/ First we have 

dimK = ^ dimi^'er(A^(/c)) = ^ (/c + 1 - rank{M{k))). 

k=n k=n 

One should notice the n x n block in the left part of M{k): 



AT 



\ (2ri-l) (2n-2) ' ' ' ( n ) / 

We have mi = m2n-e- From the above considerations, the determinant of this block is the 
dimension of the nontrivial, irreducible representation S^'^^'''''^"^V, Aj = n, of SL{V), where V is 
a vector space of dimension 2n. 

With this fact, since these representations are nontrivial, these determinants are never zero, and 
therefore, for all A; = n, . . . , 2n — 1, rank{M.{k)) = n. So 

n{n + 1) 



dim K = k 



k=l 



□ 
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3.2.2. The case r — 2n. Ifr — 2n, for A; = n, . . . , 2n we have 



Mik) 



( run+i rUn 
mn+2 rUn+i 

\ m2n rn2n-i 



mi mo 
m2 mi 





mo 



m„ m„_i m„_2 



is an n X {k + 1) matrix. Now, m^ = (^Y^) , ^ = 0, . . . , 2n. 
We have 

K^®ll^Ker{M{k)). 



Lemma 3.3. We have 



dim K 



(n + l)(n + 2) 



Proof. Consider the left-most n x n matrix block of the matrix M.{k) 

1+1 ) ' ' ' V 3 / 



(2n+l\ (2n+l\ 
V n+2 / V n+1 / 



/2n+l\ (2n+l\ (2n+l\ 
\ \ 2n ) \2n-l) " ' \ n+1 ) / 








m2n-k / 



As in the previous case, the determinant of this block is the dimension of the nontrivial repre- 
sentation 5'{'^i' -''^"}y, Aj = n + 1, of SL{y), where y is a vector space of dimension 2n + 1. 
Therefore, for all A; = n, . . . , 2n, rank{M.{k)) — n. 

We have 

dimi^ = ^dim Ker{M{k)) = Y^{k + 1 - rank{M{k))) = + + ^) _ 



k=n 



k=n 



□ 



3.3. Roth's equation and LU-decompositions. In this subsection we will discover how our prob- 
lem relates to LU-decompositions of matrices and to solving certain types of matrix equations. 
Again, we divide the analysis into the two cases: r = 2n — 1 and r = 2n, and we use the results in 
the previous two subsections: we have the n x n block denoted with jV^ that is invertible. 

3.3.1. The case r — 2n — 1. For each k — n, . . . ,2n — 1 we have 



M{k) ' 



Qn-l,k-n+l 
Qn,k—n 

V Qk,o ) 











Qn+lfi 

\ 



Q2n-1,0 / 
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Multiplying to the left the above equation by J\f^^, putting everything together, we obtain 

/mo ■ ■ ■ \ / 1n,0 qn.l ■ ■ ■ gn,n.-l \ 

\ TUn-l mn-2 " " " "^0 / 

Observe that the lower triangular matrix above is exactly the matrix V. Also denote with Q, the 
upper triangular matrix we see above. The entries of this matrix consist of all the Qa^h not occurring 
in Q. 

We obtain 

Q = -Af-'VQ. 
We have the same result for the parameters Su,v' 

S = -j\f-^vs. 

We had that VS = JQ^V^J. Denoting with S = VAf-^V, we have 

and since J' = = j'^ , denoting with U = J'S, we obtain 

US = {UQf. 

Our original problem translates into the following question regarding the solution of a certain 
type of matrix equation: 

Proposition 3.4. We have dim K(2n — 1) = n if and only if the matrix equation 

UX - Y^U'^ = C, 

has a solution consisting of two upper-triangular matrices for any n x n matrix C. 
Proof. The matrix equation has the desired solution if and only if the M— linear map 



Ti(n + 1) n(n + l) 



2 



given by (j){S, Q) = US — Q^U'^ , is surjective. □ 

But this is equivalent to the dimension of ker{(f)) ~ K{2n — 1) being equal to n{n + 1) — = 
n. □ 



3.3.2. The case r = 2n. For this case the same things occur. For each k = n, . . . ,2n we have 





(ln-l,k-n+l 
Qn,k—n 







^Here we have used the fact that an upper- triangular ji x n matrix is described by "^"^^^"i parameters. 
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Multiplying to the left the above equation by A/" ^ and putting everything together, we obtain 

/ qO,n ■ ■ ■ g0,2n \ / nil mo ■ ■ ■ \ f Qnfi " " " Qn^n \ 

• • • gi,2n-i . 1712 nil mo ■ • • 



V Qn- 



1,1 



'?l,2n-l 
Qn-l,n+l / 





V 



(ln+l,n-l 



\ nin nin-i nin-2 ■■■ ruo / 

The matrix to the left of the equality is an n x (n + 1) matrix consisting of the first n rows of the 
matrix Q we saw before. The missing row occurs as the first row in the upper-triangular matrix we 
see on the right. 

To correct this inconvenience, let J\f' be the (n + 1) x (n + 1) matrix 



Q2n,0 I 














and since mo = 1 observe that 



mo 








■ 


\ 


mi 


mo 





■ 




m2 


mi 


mo ■ 


■ 




rrin 


rrin-i 


mn~2 ■ 


■ mo 


/ 



( <?n,0 





V 



^Zn+l.Ti-l 
92n,0 / 



Observe that the lower triangular matrix above is exactly the matrix V>. Also denote with Q, the 
upper triangular matrix we see on the right side of the equality. 
We obtained 

Q = -Af'VQ. 

We have the same taking place for the parameters Su,v' 

S = -M'VS. 

We had that VS = JQ^V^J. Denoting with £ = VU'V, and with U = JS similarly to the 
case r = 2n — 1, we have 

US = {UQf. 
As for the other case, with a similar proof, we have: 

Proposition 3.5. We have dim K{2n) = n + 1 if and only if the matrix equation 

UX - Y^U'^ = C, 

has a solution consisting of two upper-triangular matrices for any (n + 1) x (n + 1) matrix C. 



The equations in the two propositions above are a particular case of Roth's equation (see 111311 '): 

AX-YB = C. 



This equation has solutions in X and Y if and only if the matrices 
have the same rank (see lfT2ll . Theorem 44.3, page 198). 



A 








B 



and 



A 


C 





B 
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Though in our case A = U and B = W^, and they are invertible matrices, our goal, and 
challenge, is to find a special type of solution: we need X to be upper-triangular and Y to be 
lower-triangular. The next lemma presents one instance when this goal is achieved. 

First, we say that an invertible matrix W admits an LU-decomposition if W has a decomposi- 
tion: W = VU, with V a lower-triangular matrix and U an upper-triangular matrix. It is known 
that for any invertible matrix W there exists a permutation matrix P such that PW has an LU- 
decomposition. 

Lemma 3.6. Let C be a p x p matrix, and W be an invertible p x p matrix that admits a LU- 
decomposition. Then there exist two upper triangular matrices X and Y , such that 

C = WX - Y^W^. 

Proof. We have the classical known properties of triangular matrices: (1) the transpose of an upper 
(lower) triangular matrix is a lower (upper) triangular matrix; (2) the inverse of an upper (lower) 
triangular matrix is an upper (lower) triangular matrix; (3) the product of two upper (lower) trian- 
gular matrices is an upper (lower) triangular matrix. 

We have W = VU. Let C = V-^C{V^)-^ and write C = C'^ - where C'^ is upper 
triangular and C[ is lower triangular (this decomposition is not unique). 

Let X = U'^C'Jv'^ and Y = U'^{C'i)'^V^ . Both X and Y are upper triangular and we have 

WX - Y^w^ = vuu-^cy^ - VC[{U^)-^U^V^ = VC'V^ = C. 

□ 

Based on possibly not enough experimentation, we ask the following question: 

Question 3.7. Let W be an invertible matrix. If for any square matrix C the equation C = 
WX — Y'^W^ has a solution consisting of two upper-triangular matrices, is it true that W has an 
LU -decomposition ? 

Regardless if the above question has an affirmative answer or not, one has the following classical 
criterion: an invertible matrix has LU-decomposition if and only if its leading principal minors are 
nonzero (see [i8j|, page 35). 

Now we go back to our problem. First, let us denote with Af the matrix J\f^^ (when r = 2n — 1), 
and also with Af the matrix J\f' (when r = 2n). Then our invertible matrix that we would like to 
show has LU-decomposition is 

U = JVAfV. 

Proposition 3.8. (1) U is symmetric. 
(2) JIAJ has LU-decomposition. 

Proof. To prove (1), observe that in both cases for r, the matrices JAf and JV are symmetric; 
this is true from the fact that multiplying a matrix to the left by the exchange matrix J, we reverse 
the rows in the matrix. If we multiply to the right, we reverse the columns. 
J"^ is the identity matrix, so Af^^J is also symmetric. Now, writing 

U = {JV){UJ){JV), 
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we obtain the result. 

To prove (2), observe that VJ = JV^ . With this we have 

JUJ = V{J\fJ)V^. 

It becomes enough to show that Mj has LU-decomposition, since V is lower-triangular and hence 
is upper-triangular. 

By the way we denoted what M is, it will be enough to show that {JM)^^ has LU- 
decomposition, or equivalently, that J'A/^has upper-lower decomposition (i.e., UL-decomposition). 

In terms of minors, we have to show that the minors of M starting from the north-east corner 
and moving down along the anti-diagonal to the south-west corner, are nonzero: 

rfet(A/'{i}x{„}) 7^ 0, (iet(A/'{i,2}x{n-i,n}) 7^ 0, ... , rfet(Ar{i,..„„}x{i,. ..,„}) ^ 0. 
Our matrix A/" is a submatrix of the Toeplitz matrix 

At = {aj_i)ij 

associated to the polynomial 

T(x) = (1 + xY^^ = ao + aix + ■ ■ ■ + a^+ix^"^"^. 

The polynomial T(x) has all roots real numbers, equal to —1 < 0, and has coefficients > 0. 
From a theorem of Aissen-Schoenberg-Whitney from 1952 (see IfTTI . Theorem 4.5, page 105), this 
will imply that all the minors of At, and therefore of A/", are strictly larger than zeroO 

In particular, the minors of our interest are different than zero, and hence the claim. □ 

Though Proposition [32] (2) does not resolve our problem, it gives lower-triangular solutions for 
our equation 

UX - Y^U = C. 
Part (1) of Proposition [X8] allows us to replace by U. 

Corollary 3.9. For any square matrix C, the equation 

UX - Y^U = C 
has a solution consisting of two lower-triangular matrices X and Y. 

Proof. From Proposition 13. 8[ for any square matrix C, there exist two upper-triangular matrices 
X' and Y' such that 

{JUJ)X' - {Y'fiJUJ) = JCJ. 
Then X = JX' J and Y = JY' J are lower-triangular solutions of the equation 

UX - Y^U = C. 

□ 

Is the corollary above enough to show that dim K{2n — 1) = n and dim K{2n) = n + 11 

Acknowledgements: We would like to thank the anonymous referee for the useful corrections, 
suggestions and comments. 

^This also gives an alternative proof that det{J\f) ^ 0, and so A/" is invertible. 
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